AITopics

Country: Europe > Germany (0.28)

Industry:

Education > Curriculum (0.41)
Education > Educational Technology > Educational Software > Computer Based Training (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Neural Information Processing SystemsApr-26-2026, 00:24:49 GMT

56c51a39a7c77d8084838cc920585bd0-Paper.pdf

learner, machine learning, reinforcement learning, (17 more...)

Country: Europe > Germany (0.28)

Industry:

Education (1.00)
Leisure & Entertainment > Games > Computer Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.97)
(2 more...)

Neural Information Processing SystemsMar-22-2026, 21:16:49 GMT

Improving Environment Novelty Quantification for Effective Unsupervised Environment Design

Unsupervised Environment Design (UED) formalizes the problem of autocurricula through interactive training between a teacher agent and a student agent. The teacher generates new training environments with high learning potential, curating an adaptive curriculum that strengthens the student's ability to handle unseen scenarios. Existing UED methods mainly rely on, a metric that measures the difference between the agent's optimal and actual performance, to guide curriculum design. Regret-driven methods generate curricula that progressively increase environment complexity for the student but overlook environment -- a critical element for enhancing an agent's generalizability. Measuring environment novelty is especially challenging due to the underspecified nature of environment parameters in UED, and existing approaches face significant limitations. To address this, this paper introduces the (CENIE) framework. CENIE proposes a scalable, domain-agnostic, and curriculum-aware approach to quantifying environment novelty by leveraging the student's state-action space coverage from previous curriculum experiences. We then propose an implementation of CENIE that models this coverage and measures environment novelty using Gaussian Mixture Models.

artificial intelligence, name change, proceedings, (8 more...)

Industry: Education (1.00)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.59)

Neural Information Processing SystemsFeb-8-2026, 18:32:28 GMT

56c51a39a7c77d8084838cc920585bd0-Paper.pdf

curriculum strategy, demonstration, learner, (15 more...)

Country:

Europe > Germany > Saarland > Saarbrücken (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)
Asia > Middle East > Jordan (0.04)

Industry:

Education (1.00)
Leisure & Entertainment > Games > Computer Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Neural Information Processing SystemsDec-24-2025, 03:43:13 GMT

Curriculum Design for Teaching via Demonstrations: Theory and Applications

We consider the problem of teaching via demonstrations in sequential decision-making settings. In particular, we study how to design a personalized curriculum over demonstrations to speed up the learner's convergence. We provide a unified curriculum strategy for two popular learner models: Maximum Causal Entropy Inverse Reinforcement Learning (MaxEnt-IRL) and Cross-Entropy Behavioral Cloning (CrossEnt-BC). Our unified strategy induces a ranking over demonstrations based on a notion of difficulty scores computed w.r.t. the teacher's optimal policy and the learner's current policy. Compared to the state of the art, our strategy doesn't require access to the learner's internal dynamics and still enjoys similar convergence guarantees under mild technical conditions. Furthermore, we adapt our curriculum strategy to the setting where no teacher agent is present using task-specific difficulty scores. Experiments on a synthetic car driving environment and navigation-based environments demonstrate the effectiveness of our curriculum strategy.

curriculum design, demonstration, name change, (7 more...)

Industry: Education > Curriculum (0.44)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.98)

arXiv.org Artificial IntelligenceNov-11-2025

DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation

Zhu, Speed, Cai, Jianwei, Chen, Guang, Wu, Lulu, Yang, Saiyong, Zhou, Wiggin

Recent reasoning-first models (e.g., OpenAI o1, DeepSeek R1) have spurred a resurgence of interest in RLVR. Nevertheless, advances are dominated by mathematics (e.g., AIME), with competitive-programming code generation underexplored and data curation receiving less attention than RL algorithm design. We investigate how to construct RLVR datasets (i.e., RL prompts) and present practical training techniques that yield strong performance on competitive-programming code generation. Our pipeline begins with supervised fine-tuning (SFT) distilled from strong open-source models, augmented with general-purpose and reasoning-intensive data. RL then follows a two-stage process with executable, testcase-driven rewards: first, training on a large, uniformly distributed set of competitive-programming problems using Group Relative Policy Optimization (GRPO) with 8 rollouts per prompt and a relatively short response-generation window (e.g., 32k during SFT and 24k in this stage) to expand entropy and mitigate repetition and truncation; second, we perform \textbf{Pre-GRPO}: updating on a small, high-quality set of challenging problems with a large rollout budget (64 rollouts per prompt) under a hard-focus curriculum that continuously retains the most difficult instances throughout training. We implement our method on Qwen2.5-32B and evaluate on LeetCode and Codeforces weekly contests to avoid data leakage. The resulting model achieves state-of-the-art performance among models of similar scale and is comparable to leading systems such as DeepSeek v3.1 and Doubao-1.5-Thinking. We also examine scaling trends and observe strong RL scaling on an internal large-scale MoE model. Our study distills concise best practices for data curation, entropy expansion, and curriculum design in RLVR for competitive-programming code generation.

arxiv preprint arxiv, large language model, machine learning, (21 more...)

2511.06307

Genre: Research Report (0.82)

Industry: Education (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Automatic Programming (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceNov-3-2025

Scale-Aware Curriculum Learning for Ddata-Efficient Lung Nodule Detection with YOLOv11

Luo, Yi, Guo, Yike, Hooshangnejad, Hamed, Ding, Kai

Lung nodule detection in chest CT is crucial for early lung cancer diagnosis, yet existing deep learning approaches face challenges when deployed in clinical settings with limited annotated data. While curriculum learning has shown promise in improving model training, traditional static curriculum strategies fail in data-scarce scenarios. We propose Scale Adaptive Curriculum Learning (SACL), a novel training strategy that dynamically adjusts curriculum design based on available data scale. SACL introduces three key mechanisms:(1) adaptive epoch scheduling, (2) hard sample injection, and (3) scale-aware optimization. We evaluate SACL on the LUNA25 dataset using YOLOv11 as the base detector. Experimental results demonstrate that while SACL achieves comparable performance to static curriculum learning on the full dataset in mAP50, it shows significant advantages under data-limited conditions with 4.6%, 3.5%, and 2.0% improvements over baseline at 10%, 20%, and 50% of training data respectively. By enabling robust training across varying data scales without architectural modifications, SACL provides a practical solution for healthcare institutions to develop effective lung nodule detection systems despite limited annotation resources.

artificial intelligence, curriculum, machine learning, (19 more...)

2510.26923

Genre: Research Report > New Finding (0.34)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.55)

arXiv.org Artificial IntelligenceJul-28-2025

Adaptive Learning Systems: Personalized Curriculum Design Using LLM-Powered Analytics

Li, Yongjie, Nong, Ruilin, Liu, Jianan, Evans, Lucas

--Large language models (LLMs) are revolutionizing the field of education by enabling personalized learning experiences tailored to individual student needs. For example, the curriculum reinforcement learning approach tailored for quantum architecture search effectively enhances computational efficiency in noisy environments by leveraging an optimized simulator [16]. The expectations and attitudes of students and teachers towards learning analytics are pivotal for effective implementation in higher education, as highlighted by recent assessments [22]. The framework's efficacy was confirmed through thorough Consequently, the personalized curriculum dynamically evolves to reflect each learner's progress, ensuring optimal The system continuously evaluates the learner's engagement Let us denote the student's performance metrics as We focus on leveraging the embeddings generated by the LLMs to classify learning materials and identify optimal pathways for students. Additionally, we assess model performance using metrics such as Learner Engagement Scores (LES) and Knowledge Retention Rates (KRR) across the implemented curriculum.

large language model, machine learning, natural language, (16 more...)

2507.18949

Country:

Asia (0.28)
North America > United States (0.14)

Genre:

Research Report (1.00)
Instructional Material (0.88)

Industry:

Education > Curriculum (0.85)
Education > Educational Technology > Educational Software > Computer Based Training (0.50)
Education > Assessment & Standards > Student Performance (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.33)

Lee, Jin Hwa, Lampinen, Andrew K., Singh, Aaditya K., Saxe, Andrew M.

Distinct Computations Emerge From Compositional Curricula in In-Context Learning

arXiv.org Artificial IntelligenceJun-17-2025

In-context learning (ICL) research often considers learning a function in-context through a uniform sample of input-output pairs. Here, we investigate how presenting a compositional subtask curriculum in context may alter the computations a transformer learns. We design a compositional algorithmic task based on the modular exponential-a double exponential task composed of two single exponential subtasks and train transformer models to learn the task in-context. We compare (a) models trained using an in-context curriculum consisting of single exponential subtasks and, (b) models trained directly on the double exponential task without such a curriculum. We show that models trained with a subtask curriculum can perform zero-shot inference on unseen compositional tasks and are more robust given the same context length. We study how the task and subtasks are represented across the two training regimes. We find that the models employ diverse strategies modulated by the specific curriculum design.

large language model, machine learning, natural language, (17 more...)

2506.13253

Country: North America > United States (0.46)

Genre: Research Report (1.00)

Industry: Education > Curriculum (0.36)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Neural Information Processing SystemsMay-27-2025, 21:17:57 GMT

Improving Environment Novelty Quantification for Effective Unsupervised Environment Design

Unsupervised Environment Design (UED) formalizes the problem of autocurricula through interactive training between a teacher agent and a student agent. The teacher generates new training environments with high learning potential, curating an adaptive curriculum that strengthens the student's ability to handle unseen scenarios. Existing UED methods mainly rely on regret, a metric that measures the difference between the agent's optimal and actual performance, to guide curriculum design. Regret-driven methods generate curricula that progressively increase environment complexity for the student but overlook environment novelty -- a critical element for enhancing an agent's generalizability. Measuring environment novelty is especially challenging due to the underspecified nature of environment parameters in UED, and existing approaches face significant limitations.

effective unsupervised environment design, environment novelty quantification, unsupervised environment design, (6 more...)

Industry: Education (1.00)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.61)